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Abstract —P2P system rely on voluntary allocation of 
resources by its members due to absence of any central 
controlling authority. This resource allocation can be viewed 
as classical control problem where feedback is the amount of 
resource received, which controls the output i.e. the amount 
of resources shared back to the network by the node. The 
motivation behind the use of control system in resource 
allocation is to exploit already existing tools in control theory 
to improve the overall allocation process and thereby solving 
the problem of freeriding and whitewashing in the network. 
At the outset, we have derived the transfer function to 
model the P2P system. Subsequently, through the simulation 
results we have shown that transfer function was able to 
provide optimal value of resource sharing for the peers 
during the normal as well as high degree of overloading 
in the network. Thereafter we verified the accuracy of the 
transfer function derived by comparing its output with the 
simulated P2P network. To demonstrate how control system 
reduces free riding it has been shown through simulations 
how the control systems penalizes the nodes indulging in 
different levels of freeriding. Our proposed control system 
shows considerable gain over existing state of art algorithm. 
This improvement is achieved through PI action of controller. 
Since low reputation peers usually subvert reputation system 
by whitewashing. We propose and substantiate a technique 
modifying transfer function such that systems' sluggishness 
becomes adaptive in such a way that it encourage genuine 
new comers to enter network and discourages member peers 
to whitewash. 


I. Introduction 

I N recent years, the popularity of the Peer-to-peer 
(P2P) system is on the rise, which is evident from 
fhe traffic measurement records of ISP's HI I2l 13. P2P 
systems follow decentralized network model, where in¬ 
dividual users, also referred as peers perform the job of 
bofh server as well as clienf. Peers share cerfain porfion 
of their resources viz. bandwidth, disk storage or pro¬ 
cessing power with other members. A P2P application 
combines the resources available across the network and 
thereafter peers can assist each other in terms of various 
services like file-sharing [4J, content lookout 0 (Sj [Z] 
[|8], data storage in distributed manner Q, digital content 
delivery ITOl , grid computing IfTTl and content transfer 
Ha. The term node and peer is used interchangeably in 
this paper. 
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The cooperative model of resource exchange in P2P 
network poses fundamental and challenging research 
issues. The potential of the P2P system, especially file 
sharing and grid computing HTI . is dependent mainly 
on the amount of resources contributed by peers. Peers 
usually are reluctant in sharing resources because there 
are inherent costs attached with sharing viz. payment 
for the upload bandwidth. Therefore, natural tendency 
of peers is to free ride i.e. consume resources without 
making any contribution back to the network. In worst 
case, P2P network will fail if nobody shares any resource. 
Several studies done in past have shown the aforesaid 
free riding behavior of the member peers in a P2P 
systems. Study performed on the Gnutella file sharing 
system in 2000 ||T3 showed that as many as 70% of 
its users were free rider and fheir percentage further 
increased to 85% liMil in year 2005. Nearly half of the total 
file search responses in network came from 1% of tofal 
nodes. This leads to congestion in these serving nodes 
thereby leading to the "tragedy of the commons" ifTsl . 
Hence "free riding" and "tragedy of commons" are the 
two important issues in implementing the cooperative 
model of information exchange and require immediate 
attention for increasing the efficiency of the P2P network. 

Many solutions such as micro payment based schemes 
Hbl H3/ game theoretic approaches HHl IIT9l EDI and 
trust or reputation approaches Eli E2| E3l E4l E5l ESI 
have been proposed in the past which provide incentive 
to the cooperating peers to solve the above mentioned 
problems, like. The monetary gain based schemes require 
virtual banks or credible authorities for implementation. 
The mathematical modeling using the game theoretic 
approaches is not accurate H22 1 because of reliance on 
some unrealisfic assumpfions like every node is aware 
of the link capacity of every other node in the network. 
Trust or reputation based approaches being simpler and 
easy to implement are extensively used to encourage the 
cooperation among users to share their resources in the 
network. 

In reputation based systems, the incentive is in terms 
of fhe service differentiation provided to the requesting 
node on the basis of its past co-operative behavior. Co¬ 
operation level is quantified in terms of peers' rep- 
utafion. Many resource distribution protocols such as 
resource bidding mechanism with incentive HD I, rep¬ 
utation based resource allocation [|22^ and probabilistic 
resource allocation | j27) have been proposed in the past, 
which give preference in service to the nodes with 
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greater co-operation level. 

The resource allocation by a node based upon the 
requesters' reputation in the P2P system can be viewed 
as classical feedback control problem. The output of the 
control system is the download capacity that overall 
network offers to the peer. The output of control system 
follows a sef poinf or reference. In our case, fhe reference 
corresponds to an utilization value, which is fixed and 
equal to 1. Utilization = 1 signifies the optimal case for 
the peer where it can utilize whole of its download ca¬ 
pacity with minimum upload contribution.This proposal 
is in accordance with tendency of rational peers 1281 to 
adopt a strategy which provides them maximum benefit 
from the network at minimum cost. Therefore resource 
allocation at every peer can be implemented using the 
control system which always leads to some resource con¬ 
tributes back to network, thereby curbing the peers' free 
riding tendency. The Utilization is explained in detail in 
section rV-AI The difference between observed Utilization 
and the reference is called error. If changes fhe upload 
bandwidth offered by the node which subsequently 
modifies the nodes' reputation, thereby changing the 
bandwidth offered of if by fhe nefwork. The change in 
fhe outpuf is always directed such that it minimizes the 
error. This results in nodes sharing minimum upload 
capacity to get the required download capacity from the 
network. 

The proposed control system can also be used to 
solve the problem of whitewashing [TSj l29l , where 
users leave the current system and rejoin it with new 
identities to avoid low reputation penalties due to their 
uncooperative behavior in the past and exploit the incen¬ 
tives provided by network to the newcomers on joining 
the network. Providing no incentive to newcomers will 
discourage peers to whitewash, however it will also dis¬ 
courage genuine newcomers to join the network. There¬ 
fore incenfive provided to newcomers is made adaptive, 
which decreases with increase in whitewashing level. As 
whitewashing level increases, the control system behaves 
more sluggish i.e. new entrants require more time to 
reach optimal state of resource sharing. 

The main purpose of studying the reputation systems 
using the control theoretical approach is to use the 
already existing tools available in the literature in control 
systems for solving fhe problem of fhe free riding and 
whitewashing. The main contribution of fhis paper are 
listed below 

1) We demonstrate, how the whole P2P system can 
be approximated as linear time-varying model for 
fhe purpose of performance confrol. We also derive 
and describe in detail the various components of 
fhe overall confrol system viz. controller, actua¬ 
tor, plant and monitor. 

2) A resource allocation mechanism is implemented 
using the control system. At every node, control 
system automates the whole process of maintain¬ 
ing a minimum upload bandwidth for getting the 
requisite download bandwidth from the network. 


3) The transfer function of fhe proposed confrol sys- 
fem has been derived. The results obtained from 
if clearly show thaf classical confrol theory can 
address the problem of the free riding. 

4) The overall nefwork performance under varying 
degree of loading at the serving nodes has also 
been studied and simulation results clearly show 
that the proposed control system is able to handle 
these changes. 

5) The equivalent transfer function and the original 
network with the control system were simulated 
independently and the results were compared to 
validate the accuracy of the derived transfer func¬ 
tion. The simulation results clearly verify fhat the 
transfer function closely models the network in 
steady state. Acceptable variations were observed 
during transient state which can be attributed to 
the linearizion l30]l and other assumptions taken 
during derivation of transfer frmcfion. For defails 
please refer to section IV-EI 

6) It has been shown through simulation results shar¬ 
ing less than what is instructed by the control 
system are unable to utilize their complete down¬ 
load capacity. Therefore they they are at loss and 
consequently don't freeride. 

7) The resource allocation performance wifh confrol 
sysfem and fhen wifh fhe existing resource al¬ 
location algorithm (R.A.) [l22| was compared. In 
simulation results it has been observed that the net¬ 
work with control system is able to achieve steady 
state faster and control system appreciably reduces 
the steady state error (i.e. difference between the 
actually shared and the optimal upload capacity). 
This happens due to the PI controller present in 
control system. 

8) Finally, the proposed control system has been 
shown to considerably reduce the whitewashing in 
the network. First, we propose a method to deter¬ 
mine whitewashing level. Afterwards depending 
upon the current whitewashing level, the transfer 
function updates itself which changes the speed at 
which a peer achieves the optimal level of resource 
sharing i.e. Utilization = 1. 

Resf of the paper is organized as follows. The section HJ 
summarizes and compares the related work, whereas the 
system model is described in section Hn] In section IIVI 
reputation management system along with the resource 
allocation is discussed in detail. In section |Vl we model 
the overall P2P network as control system and derive its 
transfer function. The section presents performance 
evaluation of fhe proposed control system in achieving 
optimality and controlling free riding, simulation resulfs 
fo verify accuracy of transfer function and finally com¬ 
parison with the existing (R.A.) [(22) allocation algorithm. 
Discussion about whitewashing problem and its solution 
using control system is provided in section IVlII Finally 
paper concludes with section IVIIII 
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II. Related Work 

Resource allocation strategies to prevent free riding 
have been proposed by many authors in the past. Sat- 
siou and Tassiulas |[22ll put forward Reputafion based 
resource allocation (R.A.), which gives preference to the 
requesting nodes having higher reputation during the 
allocation. Higher reputation implies greater contribu¬ 
tion level, therefore R.A. strategy motivates users to co¬ 
operate. In R.A. strategy requesting peers are sorted 
by the corresponding serving peer in the decreasing 
order of their reputation to demand ratio. The serving 
peer, first satisfies the request of the peer highest in the 
list, afterwards if the resources are left then it will be 
distributed to the next highest peer in the list and so on. 
A peer increases or decreases its capacity in fixed step 
sizes to manipulate its reputation based on the amount 
of the resource it wants from the overall network. The 
model in |22) is limited to the single capacity link 
i.e. sum of the upload and download bandwidth will 
be constant and nodes will decide the partitioning of 
bandwidth between upload and download. Gupta et 
al. proposed probabilistic resource allocation Il27l where 
nodes are selected probabilistically for resource alloca¬ 
tion. Probability of the requesting node getting selected 
for the resource allocation increases with increase in its 
reputation. However the lower reputation nodes still 
have some finite probability of getting selected and 
therefore they don't get completely discormected from 
the allocating node. In both R.A. | (22| and probabilistic 
resource allocation Il27l , the bandwidth is increased or 
decreased in fixed step size therefore it is very difficult 
to achieve the convergence of the upload bandwidth. 
During the steady state upload bandwidth will keep 
on oscillating with amplitude of oscillation proportional 
to the step size. Transient response of the network can 
be improved if the step size is kept variable. Both the 
problems mentioned above are addressed and described 
in detail in section ??. 

Among the other related work 1281 Meo et ah, con¬ 
templated network as market and proposed that sec¬ 
ond auction strategy leads to the optimality. However 
the study is limited to the homogeneous peers with 
the single capacity. Another serious limitation with this 
scheme is that the member node can perform only single 
upload and download at a given time. Ma et al. IT^ 
put forward a modified version of the progressive water 
filling algorithm, where requesting nodes are provided 
resources at the rates proportional to their reputation. 
Ma et al. made an analogy of the resource allocation 
with filling a bucket, where base of the bucket is di¬ 
rectly proportional to the reputation of fhe node. The 
author theoretically proved that such kind of allocation 
maximizes the marginal utility. However author has not 
supported his hypothesis with simulation analysis. Yan 
et al. ISTII presented a theoretical framework for fhe opti¬ 
mal resource allocation. Peers' ranking and the utility are 
the basis for the resource allocation such that resource 


distribution achieves max-min fairness. The theoretical 
analysis done in IT^ 1311 considers only stationary com¬ 
peting peers. D 5 mamics of compefing peers like variation 
in the demand of requesters is absent in their study. 

Various papers have suggested array of techniques 
in past to prevent free riding in P2P network. Direct 
reciprocity schemes like "tit for tat" 1821 employed in Bit 
Torrent file systems 1881 encourage co-operative behavior 
among the member peers. In direct reciprocity schemes 
service provided by the node i to the node j will solely 
depend on service j has provided to i in the past, 
while completely ignoring j's contribution to the other 
users in the system. However such kind of schemes are 
effective when members remain in the system for long 
time so that there are ample opportunities for recipro¬ 
cation among them. In the current P2P networks where 
there is large population size, high churn rate 1811 and 
infrequent repeat transactions, the reputation systems 
are more effective than the direct reciprocity schemes. 
Feldman et al. ITSl gave the concept of the generosity 
of a node, which is the ratio of the service provided to 
the service received by the node from the network. Based 
upon the estimated generosity, the node will be imparted 
service from the network. However no mathematical 
analysis was provided by Feldman et al. for countering 
the problem of the free riding in the P2P network. In the 
same context Kung et al. 1851 advocated that nodes' usage 
of the resources and contribution back to the network 
should be the basis of fhe selecfion of fhe nodes for the 
resource distribution. Nodes are expected to contribute 
above a certain threshold level to be eligible for the 
resource allocation. Banerjee et al. 136| proposed that 
serving peer will compute the expected utility function 
of the requesting nodes. The decision to share by the 
serving peer will be based upon this expected utility 
function. Gupta et al. 12T1 put forward a reputation based 
system that takes into account various uncertainties at 
the serving node to arrive at more accurate estimate of 
fhe reputation. It also carries out the exponential moving 
average of the present and past reputation values to pro¬ 
vide appropriate weightage to the past behavior of the 
node. However solution proposed in [181 ESI EH lEI 
do not discuss about mechanism to distribute resources 
among the requesting nodes. 

Friedaman et al. 187 1 pointed out that whitewashing 
takes place in P2P networks due to availability of the 
free identities or cheap pseudonyms. Many authors l38l 
1891 in the past have suggested that whitewasher in 
a network can be completely removed by assigning 
permanent identities to its members. However John R. 
Doucher llOl asserted that it is possible for members in 
decentralized network like P2P networks to have mul¬ 
tiple identities. Therefore P2P network suffers from the 
problem of free riding. Yang et al. BTl studied Maze file 
sharing system and came to conclusion that incentives 
encourage whitewashing. Friedaman et al. 1821 proved 
that considering every new comer as a potential white- 
washer is always a better policy over any other static 
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Fig. 1. The P2P Network Model 

stranger policy for avoiding whitewashing. Grolimund 
et al. 1421 proposed to assign low initial reputation to the 
new comers in the network for preventing free riding. 
However assigning low reputation to newcomers will 
discourage legitimate newcomers from joining the net¬ 
work. Therefore initial reputation needs to be adjusted 
periodically on the basis of level of whitewashing so 
that there are enough incentives which motivate new 
users to join the network and at the same time degree 
of whitewashing can be reduced. 

A control system is proposed in this paper to avoid 
free riding for both stationary and dynamically compet¬ 
ing peers. It employs PI controller f30l which reduces 
the steady state error of the P2P system and at same 
time helps the system to achieve steady state faster. On 
modifying the initial reputation (which is granted to the 
newcomer at the time of joining the network) on the 
basis of the level of whitewashing, the control system can 
successfully counter the problem of the whitewashing in 
the network. 

III. P2P Network Model 

We analyze a unstructured P2P network of N nodes, 
where nodes consume and share the bandwidth on 
demand. There is no centralized server and every peer 
acts as both client and server concurrently. Each peer in 
the network is connected to backbone network through 
an access link. In this way peers are connected to each 
other as shown in Fig. [T] 

In this paper we study an overloaded network as 
considered in | (22j Ii43l where at any given node, there are 
always requests pending to download data and content 
are present for downloading. Segmentation of content 
into chunks and their distribution across the network is 
out of scope for this paper. 

The network is assumed to consist of symmetric links 
only i.e. for a link between two nodes i and j, the feasible 
data rate and the bandwidth required to support it, is 
same in both the directions. Download requirements at 


the node i are assumed greater than or equal to maxi¬ 
mum download capacity of the node i. This assumption 
is taken to model the worst case of maximum overload¬ 
ing in the network. It is further assumed that the total 
upload bandwidth available at the node i is sufficient, 
such that when it is bartered with other requesting nodes 
in the network, the download requirement at the node i 
will always be fulfilled. 

For the purpose of modeling, time is divided into dis¬ 
crete time slot. Each time slot signifies a period. We have 
considered unstructured P2P network for simulation but 
the proposed control system is applicable for structured 
P2P networks also. However we have not performed any 
simulations to substantiate this claim. Every time slot is 
termed as a round. The nodes send the requests at the 
starting of the rormd and if they gets selected for the 
service allocation then their request are fulfilled by the 
serving nodes within the same round. For the next round 
the whole process is repeated again. 

IV. Reputation model 

Any P2P network is meaningful only when nodes 
cooperate and fulfill each others' requirements. Network 
under consideration, contains selfish peers who are ra¬ 
tional i.e they are interested in maximizing their own 
benefit without caring about the other members. Selfish 
nodes never want to contribute back to the network. 
Therefore they will always try to become a free rider, 
unless their is some incentive attached in sharing of their 
resources. This leads to the overloading at the few nodes 
which are contributing back to the network also known 
as tragedy of commons. In the worst case, the network 
will collapse when every node becomes a free rider. 
For avoiding the problems of free riding and tragedy of 
commons, a reputation management system can be used 
in P2P network. Every node's reputation is evaluated at 
the end of the each round based upon its performance 
in that round. 

A. Reputation Calculation 

The trust is measure of the cooperative behaviour of 
the node. The trust model used in this paper is similar 
to the one proposed by Satsiou and Tassiulas Il22l . The 
trust calculated by the requesting node j for the serving 
node i during the period is defined as the ratio of 
the bandwidth (r" ) received by the node j to what it has 
actually demanded (B^ from the node i during the n^^ 
period i.e. 

yfl 

f” = (1) 

;; gii W 

For ease of reading, the important notations are listed 
in Table HI Reputation of the node i for the (n -i-time 
slot is calculated using the exponential moving average 
Bill of reputation up to the n^^ period with the average 
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TABLE I 

Notations with their Description 


Notation 

Description 

R" 

Reputation of node i at the end of 11 “ time slot 

VI 

n 

Trust of i as measured by j for ii'" time slot 

r- 

P 

Bandwidth received by j from i for time slot 


Bandwidth demanded by j from i for n”' time slot 

gZl/iT.V 

The maximum download capacity of the node i 

1" 

No. of requests received by the node i in slot 

Si 

No. of requests generated by the node i in slot 


The minimum amount of bandwidth required to 
support feasible data rate practically possible on the 
link between i and j 

mm 

The minimum value of the bZ"'*, among all the 
possible values of i and j in the network 

u. 

^lovd 

Proportionality constant for handling change in 
bandwidth received by i due to overloading at j 

A" 

Set of serving nodes for the node i during n”' period 

5/ 

Set of requesting nodes, requesting resources from 
node i during n‘'' period 

^min 

Reputation threshold below which node does not 
receive any service from network 

W” 

Level of Whitewashing during n'" period 

N" 

Total nodes in network during period 

^max 

in 

Maximum initial reputation that can be assigned 
to newcomers 

R" 

in 

Initial reputation assigned to newcomers during 
period 


of trust values for the current time period as give below 


n+1 


= (a)R” + (1 - a) 


L r 


( 2 ) 


In the above equation is the reputation of the node 
i at the end of the (n + 1)®* period and is the set 
of nodes which have requested the node i for resources 
during (n + period. The cardinality of the set 

i.e ( L 1) is equal to Here is the number of 
j€S" ' * 

the requests received by the node i during the {n + 1)®* 
period. Using reputation aggregation algorithm |[26ll 1451 , 
reputation tables are updated periodically by every node 
so as to maintain identical reputation tables across the 
network, a determines how the exponential average 
forgets the contribution of the past values and maintains 
its value in between [0,1]. Higher a discounts the older 
observations faster and a = 1 completely ignores the 
older values. We take value of a as | in the equation 
(|2) so that equaQ priority is given to the present as well 
as past reputation values.Therefore the reputation of the 
node i in this paper is calculated as 


R” + 


RT^ = 


S tji 
/sSpl 

jn+1 


(3) 


^In this way nodes are discouraged to abruptly change their behavior 
during one time period so as to gain advantage in the reputation for the 
future transactions. The nodes need to be co-operative over a period 
of time for getting better quality of services than the existing services. 


The reputation of a node is the measure of its cooperative 
behavior. The service or bandwidth allocated to a node 
in a network increases with the reputation of the node 
and vice versa. Therefore for acquiring more download 
capacity from the network, the node has to share more 
bandwidth towards the network. 


B. Reputation Based Resource Distribution 

The resources are distributed among the requesting 
nodes using weighted water filling algorithm |[l9l . In our 
proposed model, the reputation of the node is used as its 
weight during the resource assignment. The source node 
distributes the resources among the requesting nodes at 
the same time but with rates equal to the their reputa¬ 
tion. If reputation of a requesting node is below a certain 
threshold say Rmin then it will not be allocated any re¬ 
source. This is done to discourage malicious behavior of 
nodes. During the resource allocation, a requesting node 
i will be taken out of allocation process if either resources 
at the source gets exhausted or assigned resources to z 
become equal to what z had demanded for the current 
round. Let 1,2....M be the M requesting nodes sorted 
in non-decreasing order of their Rk and bj^ are the 
reputation and bandwidth requested by the node k. The 
bandwidth allocation is done in the order as per the 
sorted list. Let fj; denote the bandwidth allocated to node 
k. It is given as 


h = min 



Rk{Ws-Lfd 

!=1 

M 

LRj 

j=k 


■; 


k = i .,M. (4) 


Here Ws is the total upload bandwidth of the serving 
node. Suppose b - [1,1.5,2,5] is the bandwidth de¬ 
mand of the requesting nodes in Mb/s from the source 
with upload capacity of IMb/s. Let the reputations of 
the requesting nodes be R = [0.625,0.25,0.625,1], then 
the bandwidth allocated in Mb/s using weighted water 
filling algorithm is given by = [1,0.8,2,3.2]. 

The bandwidth allocated to any requesting node z 
from i during the (n + 1)®* time period is proportional 
to its reputation R” in the previous perioq 3 bandwidth 
demanded from j and a proportionality constant 
which caters for change in received bandwidth due 

Ucvd . ° 

to overloading at j. Therefore bandwidth received by z 
from j during (n -I- 1)®‘ period is given 


pi+i 

<7 


= UfixRfxBf+i. 

Uovd ® '/ 


(5) 


The value of k!D^ lies in between [0, ^]. = 0 denotes 

the worst case of overloading at serving node j, where 
the receiving node z will not receive any bandwidth. 
When Zc"U = -U then the bandwidth requirement of node 


^The reputation of the requesting node during current round will be 
evaluated at the end of time period and will be used as weight in the 
next round. 
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i will be completely satisfied by the node j. The value 
of proportionality constant kjj^^ varies in each round 
depending upon the degree of the overloading at the 
serving node. At the same time for another requester 
say X, requesting the same serving node, the value of 
^^L,d vary depending upon its reputations. 

The total bandwidth received by the node i during 
(n + period is given by 

Q(n + l) = Krx Y, (6) 

yeApi 

where is the set of the nodes from which node i 
has demanded resources during the {n + roimd. We 
assume reputation aggregation so that R/” estimated is 
same for all the nodes. 

V. P2P NETWORK MODELING AS CONTROL SySTEM 

The objective of the control system is to enable each 
peer to maintain the minimum upload so as to maintain 
the just enough reputation, to get the desired download 
capacity. The control theoretic modeling can improve 
the performance of the reputation management system 
as a peer can adapt and derive maximum utility from 
the network. The design of a control system involves 
determining the equivalent components i.e. controller, 
actuator, plant and monitoring unit. A mathematical 
model has been developed for this purpose. Finally the 
overall performance of the devised control system model 
need to be evaluated to determine the amount of the im¬ 
provement achieved. The additional overhead incurred 
in this process also needs to be carefully analyzed. The 
following subsections discusses the set point determina¬ 
tion and various other components which constitute this 
control system. 

A. Determination of the Set Point 

The proportion of the requesting peer i's demand 
which is fulfilled by the network in the round in 
comparison to its download bandwidth is given by 

m (7) 

i 

Here Cfn) is the total data rate agreed by all the serving 
peers to be given to the requesting node i and is the 
maximum download capacity of the requesting node z 
from the nodes in the network .The serving peers will 
never give wrong information about the Ci{n) because 
their reputation is calculated on the basis of actual data 
rate delivered and not the data rate they are willing 
to provide i.e. Q(n). We assume that all the peers are 
rational, therefore there are no malicious peer. When a 
peer is assumed rational then it will not act maliciously 
|[43l , as rational peers gain nothing by harming other 
peers. Therefore a node will not receive incorrect C,(n). 

When the requesting node i increases its upload ca¬ 
pacity then its reputation increases consequently making 


i eligible for more download capacity Ci(n) from the 
network. Hence Ci(n) of the requesting node increases. 
Three cases may arise in the equation (0. When Q(n) < 
ptruix satisfaction derived by the peer i from the 

network increases with increase in C,(n). The satisfaction 
attains its maximum values at C,(m) = B™* i.e. !T,(n) = 1. 
After Ui(n) = 1, higher value of C,(n) will actually not 
be useful to the node because it exceeds the maximum 
data rate the requesting node can handle. Node is at loss 
when Ui{n) > 1 because it has to pay more in terms of the 
greater upload capacity for the download rate limited 
by Bj"“. Therefore to derive the maximum benefit from 
the network, a node should maintain Ufn) = 1, while 
keeping the upload rate to minimum, so that it can full- 
fill all of its download requirement at the minimum cost. 

Hence the set point or reference point for our control 
system is Uref = 1. The feedback monitor in the loop will 
estimate fimction Ufn); this value will be compared with 
the reference point (Href = 1) and the error is fed to the 
controller. It is assumed in the system model proposed in 
section[TJ that every node i has adequate upload capacity, 
such that when it is bartered with other requesting nodes 
in the network, then download requirement of node i 
will always be satisfied. In other words upload capacity 
available at node is adequate enough to make the Ufn) 
at the node i equal to the Uyef of the control system. 

The Ui{n) discussed above is different from the concept 
of utility 1461 . The utility corresponds to the degree of 
the satisfaction received by the node from the network 
whereas Ufn) is dependent upon the total resources that 
are available to the node from the network. Some time 
the nodes' capacity may not be sufficient to utilize all 
the resources. Therefore these extra resources which are 
considered in Ufn) are useless for the node and don't 
contribute in overall satisfaction and consequently in the 
utility derived by the node. 


B. The Controller 

The controller 1301 stabilizes the system output to a 
particular value called set point. It compares the dif¬ 
ference between the desired and the actual U in the 
proposed control system. Based upon this comparison, 
the controller drives the actuator to regulate the output. 
Generally PID controller is preferred as a controller in 
control system. However when the P2P network reaches 
the steady state, nodes generally share in proportion 
to their requirement so as to preserve their reputation. 
Therefore total shared capacity in network does not 
change abruptly and consequently so does U. The differ¬ 
ential action is used to counter sudden changes in output 
thus it is not required in our system. Therefore the PI 
controller with transfer function G(s) = Rp(l + 7 ) Il30l is 
used in this paper to model the control system. Kp and 
Ki are the proportional and integral gain respectively of 
the PI controller. 
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C. The Actuator 

The role of the actuator 1301 in the control loop is to 
update physical entity based upon the controller output. 
In our case, the actuator modifies the upload capacity of 
node in response to the controller output y. If y is the 
input to the actuator then yA is the the upload capacity 
shared by the node in the next round. The change in the 
upload capacity by the actuator modifies the reputation 
of the node, thereby adjusting the download data rate, 

O.WflX 

the node receives from the network. A = is taken in 
accordance with the existing allocation algorithms 1221 
l27 l for ease of comparison with them. 


D. The Monitor 


The function of the monitor at the node i is to sense 
the current data rate Q(n) offered to the node i, for 
evaluating the Ui(n). The evaluated U,(n) is compared 
with the set point to calculate the error and drive the 
controller which in turn decides how much resources 
are shared by the node i to the other nodes. Let (yA)"^^ 
corresponds to amount of resources allocated by the 
node i to the requesting node j during (n+1)®^ round. The 
trust calculated by the requesting node j for the serving 
node i in the (n + 1)®^ round is given by 


(yA)”+i 

j«+i ^ 'P 
ji gii+1 

i‘ 

Using equation ([3) the reputation of the node after the 
(n + 1)®* period is given by 




R" + 


Z 

/eSpl 


(yA)^+l 

B"+l 

P 


jn+1 


2 


( 8 ) 


The symbols have been defined along with the equation 
©. The total bandwidth received by the node i during 
(n + 2)^^ time period is calculated using equation ([Sll as 


Q{n + 2) = Rr^x Y, 

k€A"*^ 


Putting the value of from equation (|8) into the above 
equation we get 




iH+l 


R" 


/ 6 Sp 


Q{n + 2) = 


jn+1 


■L 


l-n+2y.D«+2 


(9) 


The serving peers allocate resources in proportion to the 
demands of the requesting peers.The requesting peers 
may receive much less bandwidth then what they had 
requested because of the overloading at the serving peer. 
In order to optimize for themselves, the requesting peers 
will demands more than their actual capacity. Normally 
the requesting peer should be requesting whatever is its 
download capacity. To overcome the problem of nodes 


demanding more than their actual capacity, the serving 
peers can estimate the feasible capacity of the links to the 
requesting peers and use it instead of what is demanded 
if the demand is more than the feasible capacity. Feasible 
capacity is the minimum amount of the bandwidth 
required to support the feasible service rate across the 
link. Feasible service rate is the maximum achievable 
throughput via underlying path in the network with 
packet loss probability p. It can be estimated for the TCP 
Reno algorithm 1471 as 


R{p> - 

RTT* 



M 


ToMm(l,3 



)p(l + 32p2) 


Here R(p) is the feasible service rate which is the function 
of the packet loss probability p, M is the maximum 
transmit window that the receiver indicates to the sender 
and RTT the round trip time between the two nodes. Tq 
is the retransmission time out in seconds and b is the 
number of packets acknowledged by each acknowledg¬ 
ment message. 

The serving peer will never give more than feasible 
bandwidth as it knows that remaining bandwidth will 
be wasted. Hence the equation Q becomes 




R” -I- 


Ci{n + 2) = 


R” -I- 


Z 

jn+1 


2 

(yA)' 


E 


k&A"*^ 


!«+! 




jn+1 


xB" 


'E 

keA"*^ 


.?j+2 

ikovd 


( 10 ) 


Here Bj”“ is the maximum download capacity of the 
node i and B^“ is the minimum amount of the band- 
width required to support the feasible data rate, as 
derived from the equation ifTOt , on the link between node 
i and j. We assume symmetric links in the system model 
in the section [Till Therefore = B^.“ i.e feasible data 

ji t] 

rate while transferring data from node i to j is same as 
that from j to i. 

Using equation lO, the measured variable U, corre¬ 
sponding to the node i for the (n -t- 2)*^ time period is 
given by 


(yA)”: 


Ui{n + 2) = 


C,(n + 2) W 


R” + 




jn+1 


B“ 


:E 

k&A"*^ 


p »+2 ^ 
^^ovd 


( 11 ) 


E. The Controlled Software System Model 

Fig. |2] shows the block diagram of the control system 
which measures the data rate received by the node 
and maintains it at the desired level by regulating the 
upload bandwidth of the node. Let H(s) denotes the 
controlled software system (which includes actuator and 




















Upload bandwidth 



Fig. 2. The Block Diagram of Control System at the Peer 


G(s) 


H(s) 


Fig. 3. The control loop 


P2P network) as shown in Fig. |3l For simplicity we 
neglect the process dynamic^ and consequently H{s) is 
equal to static gain h. On linearizing 0, the gain h l(30l 
around the reference point (lire/ = 1)/ we obtain gain as 
the derivative of the output (U;) with respect to the input 
(y) as shown in the Fig. |2l Using equation dTTb . the static 
gain of the requesting node i is given by 


h = 


dU,{n + 2) 
dy 


R" 


/eS; 


(yA)'-- 


X 


dy 


( 12 ) 


The maximum gair0 occurs when = 1 i.e. node 
i receives only one request during given time slot and 
= 5"^“ . Where is the minimum value of the 

]i min mm 

among all the possible values of i and j, i.e. 


= min B^.“. 

min I j p 


Another factor as discussed in section HWB] attains 
its maximum value when reputation of node is mini¬ 
mum. Rmin is the minimum value of the reputation at 
which a node is eligible for resource allocation in the 
P2P network . Therefore the maximum possible value of 
{kV'^) such that node can receive data from network is 

J^ovd' 

. Hence the equation ((T2t reduces to 


h 


max 


min 

V 1 V V 1 

2 

^ R ■ Zj ^ 

A 



dy 


(13) 


Yj 1/ represents the cardinality of the set and 

t 

is equal to the number of the nodes from which 
resources have been requested by the node i during 
(n + time slot. As peers are rational so to maximize 
their chances of getting content from the network they 


^Process dynamics play significant part during transient phase of the 
system. We assume network has reached steady state and its behaviour 
is virtually constant with time 

^If any non linear system involves small signals and operates around 
the equilibrium point then it can be approximated as linear system 
within a limited range of the operation. 

®The system is designed for the maximum gain. If system is stable 
for the maximum gain then it will also be stable for the lesser. 


will try to generate maximum amount of data requests 
hence gp^ - gmax ■ Therefore equation lIT^ becomes 




ktnax — 




dy 


The maximum static gain hmax on differentiation is 
given as 

(14) 

? R • R-^ 


kiniix — 


Change in upload capacity induced by actuator is ob¬ 
served in the next period. This results in dead time 
(delay) of one period T. Therefore the overall transfer 
function of the controlled system is given by 


H(s) = hmax-e 


The equation ifTS does not contain any term related to 
the node i so it is a generic expression applicable to any 
of the receiving node. Thus, the transfer function of the 
controlled software system is given by 


H{s) = 


( A? ' 

^^max 
01? ■ 


.-sT 


(15) 


The overall transfer function of the control loop as 
shown in Fig. |3] is given by 

T{s) = H(s)G(s) = hmax.e-^'^Kpi + (16) 

s 

1) Deriving Model Parameters: When the frequency of 
the operation is the natural frequency of oscillation, 
then the transfer functior0 of the control loop given by 
equation dl^ becomes 

h„ax.e-’^’'^Kpi + A) = -1. (17) 

JZVfi 

The gain margin G specifies the stability of the control 
system and is set by the designer. Therefore using equa¬ 
tion the gain and the phase equations for closed 
loop at natural frequency becomes 

h„ax\{e->^"P{Kp{l + ^)| = i, (18) 

JlVn G 

and ^ 

WnT + tan~^(—) = n. (19) 

Wn 

^We substitute s = jw in the overall transfer function of the control 
loop 
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TABLE II 

Simulation Parameters along with their Values 


Parameter 

Description 

Value 

N" 

Number of the nodes in 
network during period 

100 

Kp 

Proportional Gain 

3.4641 X 10“^ 

Ki 

Integral Gain 

1.51149 

G 

Gain Margin 

10 

Smax 

Maximum number of requests 
generated by the node in 
a given time period 

2 

a 

Exponential moving 
average constant 

1 

2 

j^max 

Maximum initial reputation that can 
be assigned to newcomers 

0.01 

R” 

in 

Initial reputation assigned to 
newcomers during period, except 

during whitewashing scenario , 
where its value is calculated 
using equation 1211 

0.01 

^tnin 

Reputation threshold below which 
node does not receive any service 
from network, except for 
whitewashing scenario simulation, 
where its value is equal to R", 

0.01 

A 

Step Size 

0.5Mbps 

min 

The minimum amount of bandwidth 
required to support feasible data rate 
practically possible on the link 
between nodes i and / 

2Mbps 


In industrial PI controller it is common practice to tune 
the controller phase i.e to Esl HU, hence 

taking 

( 20 ) 

ZUfi D 

The parameter G and T are set by the designer and h„ax 
is calculated using equation dlSll . Using equation (l20ll in 
we get the value of u>n as 

571 2.618 

= -^f^rad/s. 
bl 1 

The parameter Ki of controller is calculated by substitut¬ 
ing above computed value of iVn in equafion ll20ll . The 
simple algebraic manipulations gives 

j. 1-512 
' T ' 

The above computed values of Wn and Ki are used in 
equafion dlSll for obtaining the expression for Kp as 

_ 0.866 

~ Ji c' 

VI. Performance Evaluation 

The current section examines the performance of pro¬ 
posed control framework through simulations.The var¬ 
ious simulation parameters along with their values are 
listed under table [III The values of parameters Kp, Ki, and 
a are evaluated from the equations given in this paper 
and parameter G's values is taken from Il49l and the 
remaining parameters value are taken from t22ll for the 



Fig. 4. Control System performance in a P2P network 

ease of comparison. During simulation time is taken as 
discrete and each discrete time slot signifies a period. The 
network gets initialized at time T = 10. Before time T = 10 
the Utilization of every node present in network is 0. 
The simulation analysis of the transfer function, actual 
P2P network and the RA f22\ an existing state of arf 
algorifhm has been imdertaken in this section. 

A. Study of Control System Performance 

Fig. |4] demonstrate how the control system is able to 
assist a node in the P2P network in attaining a stable 
value of the utilization around the reference point i.e 
Lire/ = 1. Initially the utilization of a node will be 0 as 
if has not shared anything, afterwards it changes and 
stabilizes aroimd Uref- To achieve the target utilization 
(i.e Uref) the node adjusts its upload bandwidth, which 
causes the change in the reputation of the node further 
leading to the change in the download bandwidth re¬ 
ceived by the node. The experimental results show that 
above discussed adjustments pushes the utilization of 
fhe node to the target utilization. 

When P2P network enters steady almost every user 
has settled down in terms of demand, therefore re¬ 
quirement of the user doesn't change abruptly. Conse¬ 
quently download capacity received by a node for a 
given amount of shared upload capacity remains al¬ 
most constant. This fact is also demonstrated by the 
simulation results in Fig. |4] where Utilization remains 
around 1 during steady state operation. For checking the 
performance of proposed control system during worse 
case scenario we artificially induce high level of loading 
in the network during steady state. Foading at serving 
node is determined by number of requests coming to 
a serving node. Overloading implies that amount of 
bandwidth demanded from the serving node is more 
than the what it has shared. Due to variation in loading 
the total download capacity promised by the serving 
node changes with time. The amount of bandwidth 
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Sampling Point 


Fig. 5. System performance under artificially induced loading in P2P 
network 



Fig. 6. Transfer Function comparison with actual network 


received by a node is directly proportional to the down¬ 
load capacity promised by the serving node. Therefore 
increase in the level of loading at the server results in 
the receiving node getting promise of lesser bandwidth 
and consequently smaller resources for the same level 
of the reputation. Simulation results in Fig. clearly 
demonstrate that fluctuations in the level of loading at 
the serving node is successfully handled by the proposed 
control system model. 

B. Veracity evaluation of the Transfer Function 

To verify and show that the derived transfer function 
in section EE] closely models the proposed P2P network, 
we simulate the various components of the control sys¬ 
tem i.e. controller, actuator, plant and monitor. Thereafter 
results obtained for both the systems discussed above are 
compared and presented in Fig. |6l 

While deriving transfer function in section IV-EI we 


assumed that system has already attained steady state. 
Therefore the steady state response is same for the 
both, whereas there is slight variation in the transient 
response. We had considered worst case during the 
derivation of transfer function i.e. maximum overloading 
at the serving node. This results in a sluggish transient 
response of the transfer function compared to the actual 
system. 

These small deviations in the systems' transient re¬ 
sponse obtained by the transfer function, from the actual 
system's response can be tolerated because almost all 
P2P systems most of the time operate in steady state. 
At the same time, without linearizion 1301 (i.e assuming 
system operation in steady state), actual system analysis 
becomes very complex due to consideration of non- 
linearities. Por considering non-reality, we need detailed 
information about the network parameters such as link 
capacities. Consequently it will be very difficult to obtain 
a generalized control system model for the P2P network. 

VII. Whitewashing 

Whitewashing in a network can be reduced by pro¬ 
viding low initial reputation 143 to the newcomers. 
However this will also discourage newcomers from 
joining the network. To make P2P system lucrative for 
newcomers, we need to increase initial reputation at the 
cost of higher level of whitewashing. The contradictory 
requirement for both high level of initial reputation for 
newcomers and low level for controlling whitewash¬ 
ing can be taken care of by making initial reputation 
adaptive i.e. it gets modified on the basis of the level 
of whitewashing in the network. When level of the 
whitewashing in the network is very high, we can 
decrease the initial reputation to counter whitewashing. 
If level of whitewashing is low than initial reputation 
to newcomers can be increased so as to motivate new 
comers for joining the network. 

The level of whitewashing in the network is calcu¬ 
lated by observing the departing nodes. There are two 
types of departing nodes in the network. One who have 
reputation greater than or equal to the initial reputa¬ 
tion provided to the newcomers in the network. These 
nodes will definitely not whitewash because they do not 
gain any advantage by whitewashing. The remaining 
departing nodes whose reputation is less than the initial 
reputation provided to new comers are the prospective 
whitewashers. We define level of whitewashing W"“^ 
during (n - period as the number of departing 
nodes in that period which can be possible whitewashers 
(i.e. have their current reputation less than the initial 
reputation). The initial reputation during the period, 
on the basis of level of whitewashing can be calculated 
as follows 


where are the number of nodes during (n - 1)*^' 

period. The value of is taken as 0.01 based on earlier 
work 1221 on whitewashing. 


W 


11-1 


N' 


fl—l 


Rl 


( 21 ) 
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Fig. 7. System performance for different level of whitewashing 


To discourage nodes from leaving the network, the 
threshold below which node becomes ineligible for ser¬ 
vice i.e. Rmin is made equal to the initial reputation of the 
joining node The reputation threshold is calculated 
at the beginning of every period. The change in Rmin 
modifies the gain h^ax of the controller as shown by 
equation ifT^ . The node if disregard the formulae and 
parameter value, then can choose different hmax value, 
though it may not be optimal for it. From the Fig. [Zl 
it is evident that as the initial reputation awarded to 
the node is made smaller, the system becomes more 
sluggish while discouraging the nodes to leave the net¬ 
work. Sluggishness signifies delay incurred by a node 
in reaching the optimal level of the resource sharing. 
More sluggish the system is, greater the time for which 
the newcomer nodes' Utilization remain less than 1. If 
Utilization < 1, then the node will be at loss, because 
it will be getting lesser resources from the network 
than what it can actually consume. However if white 
washing level is lower in the network than the value of 
the initial reputation awarded can be increased so that 
system becomes faster and genuine new nodes entering 
the system are not unnecessarily penalized. 


VIII. Conclusions 

In this paper, we have used the control theory to solve 
the problem of freeriding and whitewashing in the P2P 
networks. We have derived the transfer function of a P2P 
network which provides a more structured and efficient 
approach to analyse the P2P networks through use of 
already existing tools in the control theory. We were 
able to automate the whole resource allocation process 
using the control system. We have also demonstrated 
through simulation that the proposed system is able to 
help the nodes in sharing optimal level of resources even 
in the case of high degree of overload in the network. 


The nodes sharing less than what control system recom¬ 
mends are at loss because they are not able to utilize their 
total download capacity which discourages the nodes 
to freeride. Further, we have used PI controller which 
improves the overall allocation process. The proportional 
action helps a node in reaching optimal value of re¬ 
source sharing faster, whereas integral action reduces 
the steady state error. These improvements are clearly 
visible when compared with the already existing RA 12211 
algorithm. Finally, through the simulations, it has been 
demonstrated that proposed system can also be used to 
control whitewashing. 
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